A Multi-Layer Holistic Approach for Cursive Text Recognition

نویسندگان

چکیده

Urdu is a widely spoken and narrated language in several South-Asian countries communities worldwide. It relatively hard to recognize text compared other languages due its cursive writing style. The script belongs non-Latin family like Arabic, Hindi Chinese. written styles, among which ‘Nastaleeq’ the most popular used font A gap still poses challenge for localization/detection recognition of Nastaleeq as it follows modified version Arabic script. This research study presents methodology classify font, regardless position image. proposed solution comprised two-step methodology. In first step, detection performed using Connected Component Analysis (CCA) Long Short-Term Memory Neural Network (LSTM). second hybrid Convolution Recurrent (CNN-RNN) architecture deployed detected text. image containing binarized segmented produce single-line fed CNN-RNN model, recognizes saves file. technique outperforms existing ones by achieving an overall accuracy 97.47%.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Cursive Script Recognition: a Hybrid Approach

Consequently, a hybrid approach to recognition has been developed. The system described in this paper attempts to recognize entire words, but should it fail, it attempts to complete the word by consulting a lexicon. Words with identical beginnings are usually morphologically related. The system selects a similar word which fits the apparent size of the input. Even if the wrong form of the word ...

متن کامل

A Uniied Network-based Approach for Online Recognition of Multi-lingual Cursive Handwritings

Although several studies have focused on recognition of individual language, no attempt has been seriously made for online recognition of handwritten script in multiple languages. In this paper, a network-based approach is proposed for recognizing sequences of words in multiple languages. Viewing handwritten script as an alternating sequence of words and interword ligatures, a hierarchical hidd...

متن کامل

A Markovian Engine for Text Recognition: - Cursive Arabic Text, Statistical Features and Interconnected HMMs

This paper presents a cursive Arabic text recognition system. The system decomposes the document image into text line images and extracts a set of simple statistical features from a one-pixel width window which is sliding a cross that text line. It then injects the resulting feature vectors to Hidden Markov Models. The proposed system is applied to a data corpus which includes Arabic text of mo...

متن کامل

Offline Recognition of Large Vocabulary Cursive Handwritten Text

This paper presents a system for the offline recognition of cursive handwritten lines of text. The system is based on continuous density HMMs and Statistical Language Models. The system recognizes data produced by a single writer. No a-priori knowledge is used about the content of the text to be recognized. Changes in the experimental setup with respect to the recognition of single words are hi...

متن کامل

A Multi-Layer K-means Approach for Multi-Sensor Data Pattern Recognition in Multi-Target Localization

Data-target association is an important step in multi-target localization for the intelligent operation of unmanned systems in numerous applications such as search and rescue, traffic management and surveillance. The objective of this paper is to present an innovative data association learning approach named multi-layer K-means (MLKM) based on leveraging the advantages of some existing machine ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app122412652